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Filters developed in order to detect short bursts of gravitational waves in interferometric detector 
outputs are compared according to three main points. Conventional Receiver Operating Character- 
istics (ROC) are first built for all the considered filters and for three typical burst signals. Optimized 
ROC are shown for a simple pulse signal in order to estimate the best detection efficiency of the 
filters in the ideal case, while realistic ones obtained with filters working with several "templates" 
show how detection efficiencies can be degraded in a practical implementation. Secondly, estima- 
tions of biases and statistical errors on the reconstruction of the time of arrival of pulse-like signals 
f* ■) ■ are then given for each filter. Such results are crucial for future coincidence studies between Grav- 

itational Wave detectors but also with neutrino or optical detectors. As most of the filters require 
£SJ . a pre-whitening of the detector noise, the sensitivity to a non perfect noise whitening procedure 

■ is finally analysed. For this purpose lines of various frequencies and amplitudes are added to a 

Gaussian white noise and the outputs of the filters are studied in order to monitor the excess of 
false alarms induced by the lines. The comparison of the performances of the different filters finally 
show that they are complementary rather than competitive. 
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PACS numbers 04.80.Nn, 07.05. Kf 

I. INTRODUCTION 

Long baseline interferometric detectors of gravitational waves (GW) [1-4] are currently taking their first data. The 
preparation for data analysis of compact binary inspiral signals, the most promising source of GW to date, with these 
new instruments has been in progress for a long time , as well as for periodic sources (see eg [5] for a review). The 



star collapse either with neutron star (NS) [6-12] or black hole (BH) formation [13]. The duration of such events is 



at most a few milliseconds and the simulated amplitudes do not exceed a few 10 -23 (NS formation) or 10 -22 (BH 
formation) for sources located at about 10 Mpc. With these typical amplitudes, interferometric detectors of the first 
generation have no hope of "seeing" such events if they occur beyond the Galaxy [12,14]. Other sources of GW bursts 
are NS binary [15-20] or BH binary mergers [21-25], for which a large amount of effort is currently underway in 
order to predict plausible waveforms. More exotic, but detectable by first generation interferometers, are possible 
GW bursts emitted by cosmic strings [26]. 

All the predicted sources of GW bursts are in fact characterised by a very rough knowledge of the emitted waveforms. 
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Unless simulations of core collapse or binary mergers can provide accurate waveforms (which seems currently doubtful) , 
the use of matched filtering, which would be the optimal method in case of perfect knowledge of the waveforms, is 
not possible. Sub-optimal filtering methods are then required for detecting GW bursts. Such methods have been 
developed in diffcrcnts groups in the last few years. The Excess Power Monitoring has been built upon in successive 
versions [27-30]. Time frequency methods are also planned for burst signal or noise non-stationarity detection [31-34]. 
In our group, we have developed filtering methods with the idea of being as robust as possible with respect to the 
possible signal waveforms [14,35-37]. We have also introduced a benchmark test in order to compare different filters 
[14] in a given situation. This benchmark is however incomplete if we want to fully understand the different methods. 
The goal of this paper is then to compare the different proposed filters. A filter based on moving average [38] is 
added to the bank of previously published filters. The definitions of the filtering methods are first recalled. We 
intentionally discard some of them, such as Bin Counting [14] or the Norm of the Autocorrelation [39], which are 
clearly not competitive compared to the others. The efficiency curves (detection probability vs false alarm rate) of 
the different filters for generic burst signals with various signal-to-noise ratios (SNR) are then computed. We first 
use optimal versions of the filter (in the sense that the filter parameters are matched to the signal) and secondly 
realistic implementations for each filter, with different "templates" working in parallel. In the following part, the 
timing properties of the filters (bias and statistical timing accuracies) are studied with respect to both signal SNR 
and signal width. The sensitivity of the filters to the noise whitening quality is finally discussed. 



In this section, we firstly describe the noise model we will use in all the following and then we enumerate and briefly 
describe the filtering methods. 



Throughout the paper, we assume that the noise is Gaussian and white with zero mean. The standard deviation 
of the noise is then : 



where /o is the sampling frequency and Sh is the one-sided spectral density of the noise. For numerical examples, 
we take f — 20 kHz (Virgo sampling rate) and y/Sh ~4x 10~ 23 /%/Hz, which is about the minimum value of the 



II. FILTERS FOR DETECTING GW BURSTS 



A. The noise model 




(2.1) 
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foreseen noise spectral density of the Virgo interferometer [40]; this choice is correct since the minimum is located in 
the frequency range for the expected burst sources of GW. The fact that we choose Gaussian noise is not essential, 
but simply convenient for the design of the filters. Deviation from gaussianity will produce for example an excess in 
the rate of false alarms and it will then be possible for example to retune the algorithms thresholds according to the 
real noise statistics. In the frequency range of interest, above a few 100 Hz, the Virgo noise sensitivity curve is rather 
flat, although not exactly white. Most of the filtering methods presented here and in [14] require a whitening of the 
noise [41,42], which is foreseen for the Virgo data processing output. In the following, we normalise the noise level by 
its standard deviation, so that we are dealing with a Gaussian noise with zero mean and unit standard deviation. We 
also denote the data Xi at sample times i/fo. Let's recall also that we conventionally define a Signal to Noise Ratio 
(SNR) after filtering as (filter output -m ) / a where m and a a are the mean and standard deviation of the filter 
output in the absence of a signal. Of course a SNR can have different dimcntionality depending on whether the filter 
is linear or quadratic for instance. This shows at least that the SNR is an ambiguous criterion when we are interested 
in comparing different filters, linear or not. 

B. The Norm Filter 

The Norm Filter [14] is a simple version of the Excess Power statistics [27]. It is based upon a monitoring of the 
local signal energy in a moving window 

k+N-l 



y k = x l ( 2 - 2 ) 



i—k 

Under this form, the filter appears to be non linear with a single parameter, the moving window size N. In presence 
of noise only, yu is distributed as a \ 2 variable with N degrees of freedom (mean N and standard deviation >/2N). 
The variable 



\ 



k+N 

2x x i ~ ^ 2N ~ 1 (2.3) 

i=k+l 



can be very well approximated by a standard normal variable, if N ^ 30 [43] . This is the definition of the Norm Filter 
(NF). We note of course that y^ F is normalised so that it is well a SNR. The response of the NF to a test signal is 
displayed in Figure 1. The NF is able to recover about 70% of the optimal SNR in this example. 
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C. The Slope filters and ALF 



A family of filters based upon fitting a straight line to the data has been proposed [37]. The two (non indcpcndant) 
results of the fit, namely the line slope a and the offset value b 

<tx> - <t><x> 



<t 2 > - <t> 



2 ' 



(2.4) 



<tx>-<t><x> x ln 

b =< x > -a < t >=< x > = = — < t > (2.5) 

< t 2 > - < t > 2 



where < x >= J2iLi x i an d = *//o> can be used as linear filters with variances 

^-2 1 till (■} t: 



12/p 2 

A(iV 2 - 1)' 



2 4A + 2 



" b = N(N — 1) ' (2 ' 7) 

The normalized Slope and Offset Filters, namely yf F = a/er a and j/^ F = 6/ct6, can be uncorrelated by diagonalisa- 
tion of the covariance matrix, yielding 

^2(1 ± a) ' 



„f = a ■ . (2.8) 



with correlation coefficient 

n = — i / 

2 V2A+ i 

The two uncorrelated filters can finally be combined and the ALF (non linear) filter is obtained : 

»r-(,;M»,-)^ gl ' t t''; 2<,ff «"») 

In presence of noise only, ALF is well approximated with a x 2 distributed random variable with 2 degrees of freedom, 
hence a mean and a standard deviation both equal to 2. The only parameter is N in all cases. Again, simple recurrence 
relations can be used in the successive calculations of the SF and OF (and so for ALF) outputs. The responses of SF, 
OF and ALF to a test signal are shown on Figure 1 . The moving window size for each filter is optimally chosen to be 
N = 140, that is about 7 times the signal half- width as stated in [37]. We see that OF is performs better in detecting 
the signal than SF in this example. As already noticed, ALF (quadratic filter) and the other (linear) filters don't 
have the same "dimension" , since ALF outputs are proportional to the energy of the signal and not to the amplitude, 
hence the much larger SNR for ALF. 
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D. The Mean Filter 



We now introduce the Mean Filter (MF), which is nothing but the filtering by moving average or boxcar average 
[38] . The Mean Filter computes the mean of the data in a moving window 

k+N-l 
i—k 

It is a linear filter with a single parameter, the window size N. In presence of noise only, yff F is distributed as a 
Gaussian random variable with zero mean and standard deviation l/^/N. The computation of the filter outputs is 
very fast, as trivial recursive relations between y^f F and can be used. So the moving window can be allowed to 
move bin by bin, without concern for CPU time. The response of the MF to a test signal is displayed in Figure 1. 
We see that the MF is able to recover almost all of the optimal SNR in this example. 

A first means of comparing the filters is to benchmark them within common conditions, i.e. subjecting them to the 
same GW signals for identical noise conditions. We first used a benchmark based on a catalogue of 78 supernovac 
signals, simulated by Zwerger and Miiller (ZM) [8,44] in the axisymmetric case (see [14] and [37] for details). 

The results of this first benchmark for the different filters are collected in Table 1. We recall those already shown 
in Ref. [37] and add the new one for the Mean Filter. We note that MF has a performance similar to the ALF's one. 



Filter 


Optimal 


NF 


SF 


OF 


ALF 


MF 


Average distance (kpc) 


27.4 


11.5 


11.3 


15.2 


22.5 


20.0 


Performance 


1 


0.46 


0.49 


0.59 


0.81 


0.78 



Table 1 : Distances of detection and performances of the different filters in detecting a sample of supernovae signals. 
NF = Norm Filter, MF = Mean Filter, SF = Slope Filter, OF = Offset Filter. The results are extracted from Ref. 
[37] except for the MF new one. 

III. EFFICIENCY OF THE FILTERS 

Unfortunately the previous benchmark gives only a partial view of filters performances, as it is computed for a 
specified false alarm rate. Of course one would wish to extend the comparison of filters to other false alarm rates, 
especially for those lying in the likely range allowed during science runs of interferometers. Such a tool is standard 
in signal processing, the so-called Receiver Operating Characteristics (ROC), which displays the curves detection 
efficiency vs the false alarm rate. In the next section, we will compute the ROC for typical (albeit of course arbitrary) 
burst signals. This will complete our understanding of the detection power of the filters. 
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A. Methodology 



For each filter, we compute the ROC for three distinct typical burst-like signals. The first is a Gaussian pulse of 
half width w = 1ms of the form 

s(t) = A exp • (3-D 

The second is a damped sinusoid of frequency / = 1 kHz and damping time r = 1 ms of the form 

s(t) = AH(t-t ) exp(-(t-i )/r) sin (2tt/(* - i )) , (3.2) 

where H(x) is the Heaviside step function (H(x) = if x < and 7?(x) = 1 if x > 0). 

The last waveform is a supernova signal from the ZM catalogue (number 6 in order of decreasing simulated signal 
energy) [8]. The three signals are displayed in Figure 2. The amplitude A of each of these three signals is calibrated 
according to the corresponding optimal SNR po (if one of the signals were detected by optimal filtering with the same 
noise conditions then the mean optimal SNR would be po). We have used for the Monte Carlo simulations a data 
window of size 2048. For each simulated data window, we first pass the filters with noise only. If one of the filters 
is triggered then we increment its the false alarm counter, else we add one of the signals to the noise and look if 
the filter detects the signal, in which case its detection counter is incremented. The efficiency of a filter in detecting 
one of the signals is then the ratio of the number of detections by the number of noise realisations without false 
alarm. Meanwhile, the false alarm rate is the ratio of noise realisations with a false alarm to the total number of 
noise realisations and then divided by the data window size, resulting in a false alarm rate per bin. The data window 
size (2048 bins) has in fact been chosen large enough to contain the signals but not too large in order to have a very 
low probability of having more than one false alarm in a single window. This obviously may occur only for very high 
false alarm rates (not reached in practice). We first study the case of optimized filters (only one "template" matched 
to the signal we consider) in order to have information about the maximal efficiencies the filters can reach. We then 
study the case of realistic implementations, with several "templates" working in parallel. It is worth noting that the 
"event" notion introduced in [37] is here automatically taken into account, as the detection algorithm stops (and the 
detection counter is incremented) as soon as the filter output is above threshold. The method is thus independant of 
the real number of bins above threshold and of the details of event clustering, as defined in [37] . 
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B. ROC for optimized filters 



We first consider optimized niters, in the sense that their parameters (essentially the moving window size N) are 
optimally matched to the signal. We consider for this purpose the Gaussian pulse signal (Eq.3.1). With w = 1 ms, 
the filters are matched with window sizes N = 40 (MF and NF) or N = 140 (SF, OF and ALF). The ROC for various 
optimal SNRs are shown in Figures 3, 4 and 5 (po = 5, 7.5 and 10 respectively). For each SNR value, MF and ALF 
show very close efficiencies. While much simpler, MF is able to compete with ALF for detecting pulse-like signals. 
However this is the ideal situation, all filters being optimally implemented with respect to the signal. We will see how 
this is modified with realistic implementations in the next section. Concerning ALF, it is again clear that we gain 
in combining OF and SF (in ALF), as OF and SF are always less efficient ALF whatever the false alarm rate or the 
signal strength. Between OF and SF, OF is always significantly more efficient than SF. Finally NF appears as non 
competitive for detecting short pulses even in its optimized version. In particular, in the case of low SNR (p = 5) 
the NF efficiency is close to zero for practical false alarm rates in the interferometers (say < 10 -6 ). For higher SNR 
(po = 10), NF can not reach 50% efficiency in the false alarm rate range we study. On the contrary for such a SNR, 
MF and ALF have efficiencies near 100% over all the range of false alarm rates. 

C. ROC for practical implementations of the niters 

In the previous section, the filters were matched for a single pulse width. In the real world however, the signal 
(if any) width will not be known in advance. Moreover the signal itself will surely not be a perfect Gaussian pulse. 
That is why the filters must be implemented with different "templates" in order to conveniently cover the signal 
parameter space. For all the filters here, the only parameter is the moving window size. Thus in practice the filters 
will be implemented (for instance in the online trigger system) with different moving windows in parallel. Such an 
implementation is shown in Table 2, where the typical burst width ranges from 0.5 ms to 10 ms. For NF and MF the 
window sizes correspond to the signal widths At (N — At x / ), while for ALF, SF and OF they correspond to about 
3.5 times the signal widths [37]. 



signal size (ms) 


0.5 


0.75 


1 


1.25 


1.5 


2 


2.5 


3.5 


7.5 


10 


MF, NF 


10 


15 


20 


25 


30 


40 


50 


70 


150 


200 


SF, OF and ALF 


35 


50 


75 


90 


105 


140 


175 


250 


500 


750 



Table 2 : Choice of the 10 window sizes to be implemented in parallel for MF and NF, and for SF, OF and ALF. The 
corresponding typical signal widths are also given. 
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The ROC for the Gaussian pulse of half- width 1 ms are shown in Figure 6. The optimal SNR is 5 (the most 
interesting to exhibit filters differences with the false alarm rates considered) . Note that the signal width corresponds 
exactly to one of the window sizes of the implementation (N = 40 for NF and MF and N = 140 for the slope filters 
familly). When compared to Figure 3, the realistic situation differs from the ideal one in particular for MF. ALF is 
the most efficient, followed by OF and SF, while NF is always the worst . The case of MF is interesting : its efficiency 
is excellent if matched to the pulse width but it dramatically decreases if implemented with several "templates" . The 
matched template (N = 40 here) is always as efficient in detecting the pulse but the other templates contribute to 
increase the number of false alarms. For the other filters the increase of false alarms due to the mismatched templates 
is much less. This can be also exhibited by comparing the false alarm rates k 50 for which the filter efficiency reaches 
50%. Note that k 50 can be also a good quality criterion for comparing the different filters. The results are shown in 
Table 3. 



Filter 


ALF 


OF 


SF 


MF 


NF 


Optimized filters 


io- v 


2 x 10-' 


2 x 10~ b 


10" v 


> 2 x 10~ 4 


Realistic filters 


2 x 10-'' 


3 x 10-' 


4 x 10~ b 


4 x 10~ b 


4 x 10~ 4 



Table 3 : False alarms rates k$o for which a filter efficiency reaches about 50% for a Gaussian pulse of half-width 1 
ms and optimal SNR p = 5. The figures are extracted from Figures 3 (optimized filters) and 6 (realistic filters). 

The sensitivity of the overall false alarm rate to the number of templates is then a problem for MF while the other 
filters seem much more robust with respect to this aspect. Indeed their efficiency curves shift only slightly to the right 
if we compare Figures 6 and 3, or their k 50 changes by only a factor roughly about 2. For MF the shift is again very 
large and its K50 changes by a factor of about 400. 

The next ROC for the damped sine signal are plotted on Figure 7. The situation is completely different than before. 
If compared to Figure 6, MF has about the same efficiency, while NF is significantly better. But for this signal, ALF 
and parent filters are the worst. The 50% efficiencies are obtained for false alarm rates K50 — 4 x 10~ 5 for NF (about 
the same as for the Gaussian signal), K50 — 2 x 10~ 4 for NF (twice as good) and about K50 — 3 x 10~ 4 for ALF, that 
is 10 3 worse than for the Gaussian signal. In this configuration ALF is not competitive while MF is the best filter in 
the list. 

Finally the ROC for the supernova signal are shown on Figure 8. We find roughly the hierarchy first obtained with 
the Gaussian pulse. ALF and related filters are the most efficient, MF arrives next and the least efficient is NF. The 
50% efficiencies are here obtained for false alarm rates about k 50 ~ 3 x 10~ 5 for ALF, OF and SF, k 50 ~ 7 x 10~ 5 for 
MF and k 50 ~ 5 x 10~ 4 for NF. 
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D. Discussion 



The first point to mention is that the relative efficiencies of the different filters depend strongly on the type of 
waveform. For instance ALF is not suited to the detection of the damped sine signal above. MF and NF have very 
roughly the same detection efficiencies whatever the signal, so in this sense these filters are robust. However ALF can 
be much more efficient than MF and NF. In all cases we find that ALF is always more efficient than SF and OF. We 
effectively gain a lot in combining SF and OF in ALF. In its optimized version (N matched to the signal size), MF 
can be as efficient as ALF (see Figure 3), but the efficiency falls dramatically with a practical implementation. MF 
is however the most efficient for the damped sine signal. NF is never the most efficient for any signal but it appears 
more efficient than ALF in detecting the damped sine signal. All these results are summarized in the Table 4 where 
the false alarm rates for 50% detection efficiency for all the signals are given. 





ALF 


MF 


NF 


Gauss (ideal) 


10- (*) 


10-' (★) 


> 2 x 10~ 4 


Gauss (realistic) 


2 x 10"' (*) 


4 x 10" b 


4 x 10" 4 


Damped sine (realistic) 


3 x 10~ 4 


4 x 10 _b (*) 


2 x 10~ 4 


ZM (realistic) 


3 x 10- b (*) 


7 x 10" b 


5 x 10" 4 



Table 4 : False alarms rates K50 for which the filter efficiency reaches about 50% for the 3 signals with optimal SNR 
Po = 5. For each signal the best k 50 value is marked ( ★). The results for OF and SF are not reported in the Table 
since their detection efficiencies are always less than ALF ones (by construction). 

It may appear surprising that MF peforms better than NF or ALF in detecting the damped sinusoidal waveform, 
as the mean of a sinusoid is 0. In fact MF is efficient because one of its "templates" width is well adapted to detect 
one of the signal peaks. In fact, a cut-off frequency may be associated with each of the templates of length N through 
fc = fo/N, where /o is again the sampling frequency. For the implementation shown in Table 2, the largest cut-off 
frequency (associated to the lowest value of N) is f^ AX — 1 kHz. Below this frequency, there will be always at 
least one template short enough to pick out only one signal peak. On the contrary, above f^ AX , all the templates 
average the signal cycles and the MF output dramatically decreases. A similar behaviour is found for ALF, due to 
the fact that the mean slope of a sinusoid with many cycles is zero. This can be seen in Figure 9, where the detection 
efficiencies of MF, NF and ALF for the same false alarm rate are shown as a function of the frequency of a damped 
sine signal of damping time r = 100 ms (long enough to have many cycles when the frequency is high enough). We 
see clearly that ALF and MF are very efficient at low frequency, while they dramatically lose efficiency as the signal 
frequency increases. On the contrary the NF efficiency is almost constant, whatever the signal frequency. This shows 
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some robustness for the NF. We note also that for the signal frequency / — 1 kHz, MF is again more efficient than 
NF and ALF, as in Figure 7, while its efficiency tends towards as the signal frequency becomes larger and larger. 



The filter timing properties are very important since timing accuracy is necessary (1) for validating coincidences 
between GW detectors, and (2) for reconstructing the signal. For instance, the signal time delay between VIRGO and 
LIGO-Hanford is up to 27 ms. Thus, the estimation accuracy of the time of arrival of a signal obtained with a given 
filter must be much less than this delay. This timing accuracy is also crucial in the case of coincidences with neutrino 
detectors. In the latter case, GW timing accuracy needs to be below 1 ms in order to not limit the measurement of 
neutrino masses [45]. 

The definition of a time-of-arrival estimator can be in general non trivial since it can depends both on the filter 
and on the signal waveform. That is why we will use in fact a simple waveform in order to evaluate the optimal 
performance of a filter to measure a signal time of arrival, keeping in mind that for a real signal the timing accuracy 
should be degraded. 

In order to evaluate the timing accuracy of the filters we proceed as follows. We consider as a burst signal a 
Gaussian pulse (Eq.3.1) with a variable amplitude A (always calibrated according to the optimal SNR po) and a time 
width w. The signal is buried in white Gaussian noise and the filters try to detect it. We define the expected time 
of arrival as the maximum of the pulse, that occurs at to for the signal of Eq.(3.1). The first arrival time estimator 
considered for the filters is given by the SNR maximum. A different estimator will be in fact used for ALF. After 
many noise realisations we can thus estimate the systematic bias At (mean of the distribution of measured times of 
arrival) if any and the statistical error a t (RMS of the distribution) on t for a given set of parameters (w and po)- 

In the case of optimal filtering (here correlation of the noisy Gaussian pulse with an identical Gaussian pulse 
template that is nothing but the Peak Correlator [14] with a single template) no systematic bias is found and the 
statistical error is [46] : 



We note that the statistical error is linear with respect to both w and 1/po- This timing accuracy is in fact the best 
that can be achieved (optimal filtering). We see that for the canonical example, w = 1 ms and po = 10, at ~ 0.15 



IV. TIMING ISSUES 



A. Methodology 




(4.1) 
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ms, well below the time delay between interferometers or the required accuracy for coincidences with neutrinos. The 
question is then to investigate if the studied suboptimal filters retain acceptable timing accuracies. The bias and 
statistical errors on the measure of to a priori depend on the amplitude po and on the width w of the pulse signal. 
For every filter, we will then study both At and at as functions of po and w. We will use the matched versions of 
the filters (window size N matched to the signal width w) so the numbers given are to be considered as upper limits, 
that are the best achievable in principle with the filters. 

B. Norm Filter 

For the Norm Filter, we first find a systematic bias At = ^ x sampling time. This shift is simply related to the 
window size N and can be easily corrected for. The correction can itself be incorporated in the filter definition so it 
plays no role. The statistical error has less trivial relations with the signal parameters. Figures 10 and 11 (left panel) 
show the behaviour of a t as function of p and w respectively. The statistical error does not behave linearly cither 
with l/po or with w, contrarily to the optimal filter. In log-log scales however, the curves are linear and the slopes 
can be obtained from a least squares fit. They differ substantially from unity and are of course larger than the (plus 
or minus unity) slopes found in the case of optimal filter. The results can be combined into a single expression for 
the statistical error 

n * 0.273 „(_) (|) . (4.2) 

C. Mean Filter 

For the Mean Filter, we find exactely the same systematic effect as for NF, At = |x sampling time. Again this 
bias can be corrected for and is unimportant. The statiscal error a t is first found to behave linearly with respect to 
the signal width w (so better than NF). But, as for NF, at is not linear with respect to 1/po, as shown in Figure 11 
(mid panel). The slope of the curve a t vs pa in log-log scales is about —0.68, well above —1 as for the optimal filter, 
but a little worse than NF. We can combine the results into a single expression: 



We note that for the canonical values p — 10 ans w = 1 ms, we obtain a t — 0.25 ms, about the same as for NF 
(about 0.27 ms), which is of course larger than the optimal filter statistical error, but not much larger. 
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D. ALF and related filters 



For SF and ALF, the response to a pulse signal shows two peaks as seen in Figure 1. The peak maxima are in 
principle symmetric relatively to to- We consequently define the time of arrival new estimator as to = {ti + <2)/2 
where t\ and ti are the time locations of the two peaks. For OF the situation is 'normal' (a single maximum) as for 
NF and MF and the time of arrival estimator is not modified. For the 3 filters SF, OF and ALF we find again the 
usual systematic At — -y x sampling time, that we can correct for. Then the behaviour of the statistical error a t is 
found to be linear with respect to the signal size w, even for the non linear filter ALF. Finally, as for NF and MF, 
the three filters do not have a linear behaviour with respect to l/po- The Figure 11 (right panel) shows the curves 
<j t vs po for ALF (results for SF and OF are not displayed). In each case the slope in log-log scales is well below — 1 
and similar to what has been found for NF and MF, that is around —0.7. The results are combined into the following 
formulas 



SF 



0.244 ws (-!?-) (^) 
Vims/ V10/ 



-0.69 



(4.4) 



OF 



0.311 ms 



VI ms/ V10 



-0.68 



(4.5) 



^™»(i^)(sr 7 '- <«» 

E. Discussion 

The first and important point is that all the suboptimal filters studied in this paper have good timing accuracies. 
The systematic bias for the time of arrival are trivial and can be easily corrected for. The statistical errors are of 
course larger than in the case of optimal filtering but are still acceptable. For the canonical example, w — 1 ms and 
/?o = 10 (see Table 5), the statistical error is about 0.15 ms for optimal filtering, about the same for ALF, SF, MF 
and NF (around 0.25 ms) and the worst is obtained for OF, about 0.3 ms that is twice the optimal value. In any case, 
in this example, the timing accuracy is well below 1 ms for all the filters. 



Filter 


Optimal 


ALF 


OF 


SF 


MF 


NF 


a t (ms) 


0.15 


0.25 


0.31 


0.24 


0.25 


0.27 
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Table 5 : Statistical errors for the time of arrival estimation for a Gaussian pulse with width w=lms and amplitude 
normalised to p n = 10. 

We then note that all linear filters behave linearly with respect to the signal width. For the non linear filters, ALF 
also displays such a behaviour (as a descendant of two linear filters) while NF does not. All the suboptimal filters 
have a non- linear behaviour with respect to 1/po, the best one here being NF (slope closest to -1), but the difference 
between the filters is not really significant as the slopes range from -0.68 for MF and OF to -0.71 for ALF and -0.72 
for NF. 

Finally, balancing all the aspects, the best filter concerning timing accuracy seems to be ALF, NF being penalized by 
its non-linearity with respect to the signal width. This conclusion was not a priori obvious, considering the definition 
and the broad response of the ALF filter (see Figure 1). However this optimistic conclusion should be moderated. 
Indeed, this result has been obtained with a simple waveform, a single peak. With a more structured signal (a 
supernova signal), the situation is in fact less favourable. The first problem is to properly define a time of arrival 
estimator. Different estimators have been tested in a Monte Carlo simulation for determining the time of arrival of 
ZM signals with ALF [39] : time of maximum SNR, time of first bin above threshold, average time between two SNR 
peaks etc... All of them have been found to be biased. Moreover the bias may strongly depend on the waveform type 
(type I, II or III as classified by Zwerger and Miiller [8]). The smallest bias, averaged over all the waveforms of the 
ZM catalogue, is about 0.5 ms for signal SNR of 5. Type III signals have the largest contribution to this bias (with 
biases around 1.6 ms in average). This shows that the ALF timing accuracy (bias + statistic error) can be in some 
cases significantly larger than 1 ms. This can have serious consequences, for example, in the case of coincidences 
with neutrino detectors. Indeed, if the GW timing accuracy is worse than about 1 ms, the determination of neutrino 
masses is degraded [45]. 

V. EFFECT OF LINES ON THE FILTER FALSE-ALARM RATES 

The suboptimal filters studied in this paper require a pre-whitening of the noise. In reality, noise whitening is never 
perfect and we need to quantify the filters sensitivity to imperfect whitening. This has been already be done for ALF 
[37] but needs to be extended to the other filters. For this purpose, about 100 hours of Gaussian white noise data are 
simulated and a single frequency component of the form Asin(27r/i) (that can mimic a large line residual) is added. 
We then filter the data (Gaussian white noise+line) with the different algorithms (including ALF in order to be able 
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to compare the filters in exactly the same situations) using slicing windows and record the effective number of false 
alarms for each one as functions of A and /. In the following, a nominal false alarm rate of 10 -6 is chosen. So for 100 
hours of data sampled at 20 kHz about 7200 ± 85 false alarms are expected (the standard deviation of about 85 is 
estimated from a binomial law). For instance, we find no = 7138 false alarms with the MF for the 100 hours of only 
Gaussian white noise. For each filter, we then increase the sine amplitude A, measure the new number of false alarms n 
each time and compute the relative excess of false alarms with respect to the ideal situation (A = 0), i.e. the quantity 
(n — no)/no. The results for NF, MF and ALF are given in Figures 12, 13 and 14 respectively. We investigate as 
examples four different frequencies, 0.6 Hz corresponding to the natural pendulum mode frequency of the suspended 
mirrors in VIRGO, 100 Hz, 200 Hz and 400 Hz corresponding to power line harmonics or wire resonances. The filters 
window sizes have been chosen to be N — 50 for both NF and MF and N — 170 for ALF, so that all window sizes 
correspond to the same matched signal size (about 2.5 ms). The results for the three filters are quite different. For 
NF the curves for the false alarms excess vs A are about the same whatever the line frequency. In contrast, for MF 
and ALF, the false alarm excess strongly depends on the line frequency, from very large excess at low frequency to 
lower or even vanishing at higher frequencies. For MF we find again the effect of the cut-off frequency associated to 
the window size N, f c = fo/N. Above this cut-off frequency, the line does not increase anymore the number of false 
alarms and has simply no effect on MF performance, since, by construction, MF averages the fast oscillations in the 
window and the net effect is zero. The same phenomenon appears also, but to a lesser extent, for ALF. We can now 
set a specification for the whitening procedure such as the excess in false alarm rate does not exceed the nominal 
value more than 10% . For NF this implies that the whitened line amplitudes must be lower than 7-8% of the ideal 
Gaussian white noise RMS a for all frequencies. For MF and ALF, the effect depends on the frequency. For example, 
for the important 0.6 Hz frequency in VIRGO, the specification is that the line amplitude should be less than about 
2% of the noise RMS for MF and about 1% for ALF. For increasing frequencies the specifications become less and 
less stringent until they can be totally relaxed above the cut-off frequency. Such specifications (from 1% to 10% of 
ideal noise RMS for line amplitudes) seem quite severe. Let's define the noise flatness [41] 



where Sh(f) is the noise one-sided power spectral density (PSD). The flatness is such that £ e [0; 1], the extremal 
values being reached for a very peaky PSD (£ ~ 0) and for a white PSD (£ ~ 1). The most stringent requirement 
(for the 0.6 Hz pendulum mode) corresponds to a maximal line amplitude at about 1% of the background Gaussian 




(5.1) 
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and white noise. If we convert this in terms of noise flatness we obtain a specification [39] that reads £ > 0.97. This 
may seem very demanding at first sight, but it is within the reach of existing whitening algorithms that can already 
achieve a level of £ > 0.98 [41]. 

Asking for no more than 10% extra false alarms due to imperfect noise whitening is in fact not very demanding 
as far as detection probability is concerned. Indeed, we can correct for the false alarm excess of 10% (in order to 
recover the desired false alarm rate) by fine tuning of the detection thresholds. But relaxing the requirements to 20% 
or 50% does not really change the situation at least for the low frequency lines (see Figures 15-16). For MF and ALF 
the specifications on low frequency line amplitudes will always be a few % of the background noise RMS since the 
increase of false alarms is a very sharp function of the line amplitude. 

Of course the number of false alarms considered so far is the raw number, i.e. the number of outputs above 
threshold. However as the consecutive outputs of filters are not independant and once a filter is triggered by noise 
alone, we find in practice not a single but a number of consecutive outputs above threshold and corresponding to the 
same false alarm event. We can thus redefine an event as proposed in [37]. A false alarm event is then no longer 
a simple filter output above theshold but a cluster of successive outputs above threshold, at most separated by the 
correlation length of the filter. For example, for the simulated 100 hours of data, without lines added, we find for 
ALF (with threshold =27.85) 7208 raw false alarms that are resolved in 1170 false alarm events, so a reduction factor 
about 6. When lines are added it has been checked that one order of magnitude on the false alarm rate is gained if we 
take the new definition of a false alarm event. This can then relax somewhat the constraints on the line amplitudes. 
As an example, for the 0.6 Hz line with amplitude A = 7.5x 10~ 2 x a, we find more than 63,000 false alarms in the 
100 hours of noisy data, giving only 6361 false alarm events, still below the nominal false alarms number (7208) for 
the 100 hours of data free of lines. 

To be complete let's finally mention that the shapes of the curves relative false alarm event excess vs line amplitude 
are in fact very close to the previous curves (with the brut false alarm excess). That means that, if we first have 
set the desired absolute false alarm rate to some value and if we set the specifications on the whitening quality in 
asking again no more than 10% false alarms in excess, we will find about the same specifications for remaining line 
amplitudes, whatever the false alarm definition. 
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VI. CONCLUSIONS 



In this paper, the principles of GW burst detection algorithms previously studied in our group have been recalled. 
The Mean filter, based on moving average, has been introduced. Their performances for detecting a set of simulated 
supernova signals have been completed and summarized. This was until now the only way to compare the different 
filters. We have then considered three other tools or criteria in order to fully understand and compare the filters. 
Firstly we have systematically made use of the conventional ROC in detecting typical burst signals. Secondly, the 
timing properties (systematic biases and statistical errors in timing reconstruction) have been evaluated. Finally we 
have studied the effect of a non perfect whitening (remaining spectral lines) on the effective false alarm rate of the 
filters. 

Concerning the old benchmark, ALF and the Mean Filter show the best performance in detecting the supernova 
signals of the Zwerger and Miiller catalogue. When looking at the ROC, ALF and the Mean Filter are still ahead 
provided they can be optimized, meaning that the signal length is known in advance. In practical situations, where 
banks of filters are used in order to cover the signal sizes space, the situation is not as clear. In particular, for short 
damped sine signals, ALF is much less efficient than the Mean Filter and even the Norm Filter. A characteristic 
feature of the Mean Filter as well as the Norm Filter is their relative robustness : their efficiency curves are very 
similar from one signal to another. ALF is much less robust; it is in general (much) better but in some cases it is the 
worst performing. Concerning the timing issues, all the filters have similar timing accuracies, worse (but no more than 
a factor 2) than the optimal filter timing accuracy. The main concern is maybe about the whitening quality which 
is required by most of the algorithms. Indeed the remaining line amplitudes are required to stay below a fraction 
of the ideal background Gaussian white RMS. Fortunately the redefinition of a false alarm event allows one to gain 
almost one order of magnitude on the effective resulting false alarm rate. Finally, taking everything into account, 
it is difficult to state that one filter is better than the others. Rather than establishing some hierarchy, as done for 
example with our previous benchmark, the important conclusion of this paper is that the different filters are in fact 
complementary. This is indeed manifest when we again look at the different ROC : a single filter can not be the most 
efficient for all burst signals. Then rather than using a single 'preferred' filter, it is advisable to operate a battery of 
different filters having their own qualities and defects. The next step is then to develop a strategy to find the best 
use of all the filters in the battery. 

As we have presented in this paper an unbiased way to estimate the performance of filters, we would like to suggest 



16 



its application in the context of development of GW burst detection methods. This could be very valuable in order 
to directly compare the performances of different filters and their complementarity or redundancy. 
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FIG. 1. Responses of the filters to a Gaussian burst signal of half width 1 ms (upper left panel) embedded in noise (upper 
right panel) which is assumed to be white Gaussian with zero mean and unity standard deviation. The 5 following plots show 
the responses of the Norm Filter (NF), the Mean Filter (MF), the Slope Filter (SF), the Offset Filter (SF) and ALF. For each 
filter the moving window size is chosen to be optimal : N = 40 (corresponding to the signal width N/fo ^ 2 ms) for NF and 
MF and N = 140 for ALF and related filters. In this example, the optimal SNR is 10; the maximal NF SNR is about 6.9, 
the maximal MF SNR is about 9.7, the maximal SF SNR is about 7.8 and the maximal OF SNR is about 9.3. The maximal 
ALF (quadratic) SNR is here about 40. We note in each case the obvious time delay between the signal peak and the output 
maximum. This will cause a trivial bias when we study the filters' time resolution (see section IV). 



19 




3 
2 
1 


-1 
-2 







III 


|w 

I I I I 



500 1000 1500 2000 
GAUSS 



200 400 
DAMPSIN 



600 




500 1000 1500 2000 
ZM 

FIG. 2. The signals used for the ROC. Upper-left : Gaussian pulse with half width 1ms. Upper-right : damped sine with 
frequency 1 kHz and damping time 1 ms. Lower : signal emitted by core collapse as simulated by Zwerger and Miiller. In the 
plots, each signal would have an optimal SNR po = 10 if added to a white noise with unity RMS. 



20 




FIG. 3. ROC for optimized filters. The signal is a Gaussian pulse of half-width 1 ms with optimal SNR po = 5. Black star 
(*): ALF, white triangle (A): OF, white circle (o): SF, black triangle (A): MF and white square (□): NF. The false alarm 
rate is a false alarm probability per bin, as in all other ROC. 
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FIG. 4. ROC for optimized filters. The signal is a Gaussian pulse of half-width 1 ms with optimal SNR p = 7.5. The 
symbols are * (ALF), A (OF), o (SF), A (MF) and □ (NF). 
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FIG. 5. ROC for optimized filters. The signal is a Gaussian pulse of half-width 1 ms with optimal SNR po = 10. The 
symbols are ★ (ALF), A (OF), o (SF), A (MF) and □ (NF). For such a signal amplitude, ALF, OF and NF have efficiencies 
very close to 1, even for very small false alarm rates. 
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FIG. 6. ROC for filters in a realistic implementation. The signal is a Gaussian pulse of half-width 1 ms with optimal SNR 
po = 5. The symbols are * (ALF), A (OF), o (SF), A (MF) and □ (NF). 
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FIG. 7. ROC for filters in a realistic implementation. The signal is a damped sine of frequency 1 kHz and damping time 
lms with optimal SNR p = 5. The symbols are * (ALF), A (OF), o (SF), A (MF) and □ (NF). 
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FIG. 8. ROC for filters in a realistic implementation. The signal is a supernova simulated signal with optimal SNR 
The symbols are * (ALF), A (OF), o (SF), A (MF) and □ (NF). 
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FIG. 9. Detection efficiency of MF, NF and ALF in their practical implementation for a damped sine signal of damping 
time t — 100 ms and varying frequency. The signals optimal SNR is po = 5 and the filter thresholds used here correspond 
to a common false alarm rate of about 5 x 10~ 4 . We note the robustness of NF while ALF and NF efficiencies decrease with 
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FIG. 10. The NF statistical error for time of arrival reconstruction as a function of the signal duration in log-log scales. The 
slope is about 1.08, larger than for the optimal filter as well as for all the other filters. 
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FIG. 11. The statistical errors for time of arrival reconstruction as a function of the signal amplitude po in log-log scales for 
NF, MF and ALF. In the case of NF, the slope is about -0.72, worse than for the optimal filter, while it is about -0.68 for MF 
(a little worse than NF) and about -0.71 for ALF (similar to NF). 
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FIG. 12. The relative false alarm excess for the Norm Filter (implemented with N = 50) for lines of different frequencies as 
a function of the line amplitude measured in a units (the Gaussian white noise RMS). The different frequencies are 0.6 Hz (A), 
100 Hz (T), 200 Hz (o) and 400 Hz (□). We note that the false alarm excess is grossly independant of the line frequency. 
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FIG. 13. The relative lalse alarm excess for the Mean Filter (implemented with N = 50) for lines of different frequencies as 
a function of the line amplitude measured in a units (the Gaussian white noise RMS). The different frequencies are 0.6 Hz (a), 
100 Hz (T), 200 Hz (o) and 400 Hz (□). We note that the false alarms excess decreases with increasing frequencies, until it 
completely vanishes for frequencies above the cut-off frequency corresponding to the window size N. Here N = 50 corresponds 
to a time szie of 2.5 ms so to a cut-off frequency of 400 Hz. 
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FIG. 14. The relative lalse alarm excess for ALF (implemented with N = 170 that is matched to 2.5 ms signals as for MF 
and NF) for lines of different frequencies as a function of the line amplitude measured in a units (the Gaussian white noise 
RMS). The different frequencies are 0.6 Hz (▲), 100 Hz (T), 200 Hz (o) and 400 Hz (□). We note that the false alarms excess 
decreases with increasing frequencies. 



32 



